Propagation of Uncertainty Through Multilayer Perceptrons for Robust Automatic Speech Recognition
نویسندگان
چکیده
Observation uncertainty techniques offer a way to dynamically compensate automatic speech recognizers to account for the information missing in real world scenarios. These techniques have been demonstrated to effectively be able to compensate multiple environment distortions and improve the integration of ASR systems with speech enhancement pre-processing through uncertainty propagation. Unfortunately observation uncertainty techniques rely on statistical methods and as such are limited to GMM-HMM architectures. In this paper we explore the application of observation uncertainty and uncertainty propagation techniques to multi-layer perceptrons (MLPs). We develop solutions for propagation through a generic MLP and exemplify potential gains with an large vocabulary robust ASR experiment on the AURORA4 database using an Hybrid MLP-HMM recognizer.
منابع مشابه
Training Multilayer Perceptrons with the Extende Kalman Algorithm
A large fraction of recent work in artificial neural nets uses multilayer perceptrons trained with the back-propagation algorithm described by Rumelhart et. a1. This algorithm converges slowly for large or complex problems such as speech recognition, where thousands of iterations may be needed for convergence even with small data sets. In this paper, we show that training multilayer perceptrons...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملDesigning and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملUncertainty Decoding for Noise Robust Automatic Speech Recognition
This report presents uncertainty decoding as a method for robust automatic speech recognition for the Noise Robust Automatic Speech Recognition project funded by Toshiba Research Europe Limited. The effects of noise on speech recognition are reviewed and a general framework for noise robust speech recognition introduced. Common and related noise robustness techniques are described in the contex...
متن کاملA MMSE estimator in mel-cepstral domain for robust large vocabulary automatic speech recognition using uncertainty propagation
Uncertainty propagation techniques achieve a more robust automatic speech recognition by modeling the information missing after speech enhancement in the short-time Fourier transform (STFT) domain in probabilistic form. This information is then propagated into the feature domain where recognition takes place and combined with observation uncertainty techniques like uncertainty decoding. In this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011